Exploiting Semantic Proximity for Information Retrieval
نویسندگان
چکیده
In this paper, we propose a method which exploits the semantic proximity of words in unrestricted natural language text to retrieve relevant documents. In order to facilitate this functionality, the system represents the documents and the query in the form of semantically relatable sets (SRS), which are a group of entities demanding semantic relations when the semantic representation of the sentence is ultimately produced. We also devise a method to augment the SRSs to further boost the performance. WordNet is used to deal with different forms of divergence between the query and the documents. In a series of experiments on TREC data, our semantic proximity based retrieval technique yields high precision with improved mean-average-precision in comparison to conventional retrieval techniques.
منابع مشابه
Exploiting Semantic Features for Image Retrieval at CLEF 2005
This paper presents the MIRACLE’s team approach to text-based image retrieval at ImageCLEF 2005 adhoc task. The experiments defined this year try to use semantic information sources, like semantic dictionaries or text structure. For this purpose EuroWordnet has been considered and a new algorithm to extract synonyms from the semantic database has been developed. This new algorithm implementatio...
متن کاملExploiting Semantic Annotations for Entity-based Information Retrieval
In this paper, we propose a new approach to entity-based information retrieval by exploiting semantic annotations of documents. With the increased availability of structured knowledge bases and semantic annotation techniques, we can capture documents and queries at their semantic level to avoid the high semantic ambiguity of terms and to bridge the language barrier between queries and documents...
متن کاملExploring and Exploiting Proximity Statistic for Information Retrieval Model
Proximity among query terms has been recognized to be useful for boosting retrieval performance. However, how to model proximity effectively and efficiently remains a challenging research problem. In this paper, we propose a novel proximity statistic, namely Phrase Frequency, to model term proximity systematically. Then we propose a new proximity-enhanced retrieval model named BM25PF that combi...
متن کاملPublic Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملSemantic Information Filtering - Beyond Collaborative Filtering
In this paper we introduce our idea of a semantic information filtering system. Contrary to traditional information filtering systems exploiting information retrieval techniques to select relevant data, we propose a workflow exploiting semantic information obtained from the web. Our system utilises the structured information crawled from the semantic web to improve the process of extracting the...
متن کامل